Diversity is All You Need: Learning Skills without a Reward Function

نویسندگان

  • Benjamin Eysenbach
  • Abhishek Gupta
  • Julian Ibarz
  • Sergey Levine
چکیده

Intelligent creatures can explore their environments and learn useful skills without supervision. In this paper, we propose DIAYN (“Diversity is All You Need”), a method for learning useful skills without a reward function. Our proposed method learns skills by maximizing an information theoretic objective using a maximum entropy policy. On a variety of simulated robotic tasks, we show that this simple objective results in the unsupervised emergence of diverse skills, such as walking and jumping. In a number of reinforcement learning benchmark environments, our method is able to learn a skill that solves the benchmark task despite never receiving the true task reward. In these environments, some of the learned skills correspond to solving the task, and each skill that solves the task does so in a distinct manner. Our results suggest that unsupervised discovery of skills can serve as an effective pretraining mechanism for overcoming challenges of exploration and data efficiency in reinforcement learning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Factors on Children's Selective Trust in Other's Testimony

Children live socially from birth to adulthood, and learning is an integral part of their living. They won’t achieve the knowledge and skills for life without learning. However, childhood period is not lasting enough for learning all of the massive amounts of information and skills required for living in this world as adults and children aren’t able to acquire the whole of knowledge and skills ...

متن کامل

P14: How to Find a Talent?

Talents may be artistic or technical, mental or physical, personal or social. You can be a talented introvert or a talented extrovert. Learning to look for your talents in the right places and building those talents into skills and abilities might take some work, but going about it creatively will let you explore your natural abilities and find your innate talents. You’re not going to fin...

متن کامل

Reinforcement Learning with Human Feedback in Mountain Car

As computational agents are increasingly used beyond research labs, their success will depend on their ability to learn new skills and adapt to their dynamic, complex environments. If human users —without programming skills — can transfer their task knowledge to the agents, learning rates can increase dramatically, reducing costly trials. The TAMER framework guides the design of agents whose be...

متن کامل

P25: Talent and Perseverance

Many people think that all you need to succeed at anything is talent but talent alone without perseverance and determination, cannot help you achieve success. Talent is helpful but perseverance ensured one achieves success. A child can show an exceptional talent for storytelling, but if he ignores his teacher’s comments and doesn’t work on his stories, he will never be a great novel...

متن کامل

Learning Roles: Behavioral Diversity in Robot Teams

This paper describes research investigating behavioral specialization in learning robot teams. Each agent is provided a common set of skills (motor schema-based behavioral assemblages) from which it builds a taskachieving strategy using reinforcement learning. The agents learn individually to activate particular behavioral assemblages given their current situation and a reward signal. The exper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.06070  شماره 

صفحات  -

تاریخ انتشار 2018